Estimating the Vocal-Tract Area Function From Formants Using a Sensitivity Function and Least Square
نویسندگان
چکیده
We present a method for estimating the vocal-tract area function from specified formant frequencies. The method extends the work of Story (J.A.S.A., 119, 715-718, 1996) based on a sensitivity function representing the change in the formant frequency due to a small perturbation of the cross-sectional area of the vocal tract. Our method estimates the vocal-tract shape through an iterative procedure in which the sensitivity function is used as the basis function to gradually optimize the cross-sectional area that produces the target formant frequencies. In addition, the summing weight of sensitivity functions is determined by minimizing an objective function representing the relative frequency error of every format. We conducted numerical experiments using area function data of English vowels. Results showed that our method can estimate the vocal-tract shape with satisfactory accuracy. In addition, the number of iterative calculations is significantly lower than with Story’s original method.
منابع مشابه
Estimation of vocal-tract shape from speech spectrum and speech resynthesis based on a generative model
Precise control of articulatory parameters is difficult and prevents a physical model from generating natural sounding speech signals. To determine vocal-tract shape from speech, this paper presents an inversion method for simultaneously estimating the cross-sectional area and length of the vocal tract. In addition, we performed speech resynthesis from a time-series of estimated vocal-tract sha...
متن کاملMappings between vocal tract area functions, vocal tract resonances and speech formants for multiple speakers
This study looks at mappings between vocal tract area functions (obtained from MRI scans), vocal tract resonances, and speech formants for five New Zealand English (NZE) speakers. All eleven NZE monophthongs were investigated, for each speaker. Principal component (PC) analysis on the area functions of both the individual speakers and combined speaker set is performed. In all cases the first tw...
متن کاملMeasurement of temporal changes in vocal tract area function from 3D cine-MRI data.
A 3D cine-MRI technique was developed based on a synchronized sampling method [Masaki et al., J. Acoust. Soc. Jpn. E 20, 375-379 (1999)] to measure the temporal changes in the vocal tract area function during a short utterance /aiueo/ in Japanese. A time series of head-neck volumes was obtained after 640 repetitions of the utterance produced by a male speaker, from which area functions were ext...
متن کاملVoice morphing based on interpolation of vocal tract area functions using AR-HMM analysis of speech
This paper presents a new voice morphing method which focuses on the continuity of phonological identity overall interand extra-polated regions. Main features of the method are 1) to separate the characteristic of vocal tract area resonances from that of vocal cord waves by using AR-HMM analysis of speech, 2) interpolation in a log vocal tract area function domain and 3) independent morphing fo...
متن کاملContinuous Voice Morphing Using Separated Vocal Tract Area Functions and Glottal Source Waves
This paper presents a flexible voice morphing method, which is based on a conversion using a linear combination of the vocal tract area functions estimated from speech signals. The method focuses on the continuity of the phonological identity of the overall interpolated area. The main features of the method are 1) to separate characteristics of the vocal tract resonances from those of glottal s...
متن کامل